Processing of Top-K Most Influential Location Selection Queries

نویسندگان

  • Rui Zhang
  • Jin Huang
  • Zeyi Wen
  • Jian Chen
  • Kerry Taylor
چکیده

Facility location selection queries help to evaluate the popularity of different facility locations for a to-be-added facility. Such queries have wide applications in marketing and decision support systems. In this report, we propose and investigate a new type of queries aiming to retrieve the top-k most influential locations from a candidate set in a given context of customers and existing facilities. The influence in the query, which models the potential popularity of the new facility, is defined as the number of reverse nearest customers the new facility can attract if it was added. Specifically, given a candidate set C, an existing facility set F , and a customer set M , the proposed query returns the top-k candidates in C with the greatest influences. The most naive solution for the query employs sequential scan on all data sets and is thus expensive and not scalable to large data sets. To improve the solution, two R-Tree based branch-and-bound algorithms are presented. One of them, named Estimation Expanding Pruning (EEP), uses distance metrics between nodes to tighten the search space, while the other, named Bounding Influence Pruning (BIP), relies on half plane styled geometric properties to achieve the same goal. Both algorithms follow the best-first access strategy guided by “hints” computed during the pruning and meanwhile gradually refine these “hints”. BIP generally outperforms EEP since it avoids repeated estimations on F and M . Yet due to the extensively accesses on R-tree indexes, the complexity in the worst case of both algorithms is unsatisfactory, causing their performance to degrade dramatically when the data set grows. To achieve better scalability, an algorithm named Nearest Facility Circle (NFC) is proposed. Rather than computing all the influence relationships from scratch as EEP and BIP, NFC first pre-computes the influence relationships between customers and existing facilities, then indexes these relationships with an R-Tree, finally processes the query using multiple cheap point enclosure queries. Furthermore, a NFC join (NFCJ) algorithm is propose to construct an R-tree on candidate set and share the common traversal cost of point enclosure query by using R-tree join algorithm. We theoretically and experimentally compare all proposed algorithms. The results show that NFCJ is the the best solution for the proposed query.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying the Most Influential Data Objects with Reverse Top-k Queries

Top-k queries are widely applied for retrieving a ranked set of the k most interesting objects based on the individual user preferences. As an example, in online marketplaces, customers (users) typically seek a ranked set of products (objects) that satisfy their needs. Reversing top-k queries leads to a query type that instead returns the set of customers that find a product appealing (it belon...

متن کامل

PAUSANIAS: Final activity report

Search engines, such as Google and Yahoo!, provide efficient retrieval and ranking of web pages based on queries consisting of a set of given keywords. Recent studies show that 20% of all Web queries also have location constraints, i.e., also refer to the location of a geotagged web page. An increasing number of applications support location-based keyword search, including Google Maps, Bing Map...

متن کامل

eSPAK: Top-K Spatial Keyword Query Processing in Directed Road Networks

Given a query location and a set of query keywords, a top-k spatial keyword query rank objects based on the distance to the query location and textual relevance to the query keywords. Several solutions have been proposed for top-k spatial keyword queries in Euclidean space. However, few algorithms study top-k keyword queries in undirected road networks where every road segment is undirected. Ev...

متن کامل

Top-k Spatial Preference Queries in Directed Road Networks

Top-k spatial preference queries rank objects based on the score of feature objects in their spatial neighborhood. Top-k preference queries are crucial for a wide range of location based services such as hotel browsing and apartment searching. In recent years, a lot of research has been conducted on processing of top-k spatial preference queries in Euclidean space. While few algorithms study to...

متن کامل

Continuous Processing of Preference Queries in Data Streams

Preference queries have received considerable attention in the recent past, due to their use in selecting the most preferred objects, especially when the selection criteria are contradictory. Nowadays, a significant number of applications require the manipulation of time evolving data and therefore the study of continuous query processing has recently attracted the interest of the data manageme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013